Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them
نویسندگان
چکیده
Comparable corpora have been used as an alternative for parallel corpora as resources for computational tasks that involve domainspecific natural language processing. One way to gather documents related to a specific topic of interest is to traverse a portion of the web graph in a targeted way, using focused crawling algorithms. In this paper, we compare several focused crawling algorithms using them to collect comparable corpora on a specific domain. Then, we compare the evaluation of the focused crawling algorithms to the performance of linguistic processes executed after training with the corresponding generated corpora. Also, we propose a novel approach for focused crawling, exploiting the expressive power of multiword expressions.
منابع مشابه
The Comparative Effects of Self-assessment and Peer Feedback on Improving Translation Quality
This study investigated the effect of self-assessment and peer-assessment on the quality of students’ transla- tion. Participants of the study were 60 male and female students. They were selected from the senior stu- dents studying English Translation and divided into two groups: self-assessment and peer-assessment. The study adopted a pretest-posttest design, and students’ translation quality ...
متن کاملThe Teaching Methods in Translation Courses: Quality, Relevance and Resources
The study was intended to provide a description of the attitudes of English-major studentstowards the teaching methods in translation courses to find out more about the relevance andquality of methods to the students’ needs, concerning the necessary educational resourcesprovided in the methods of teaching. Accordingly, a multi-item Likert-scale questionnairecontaining 32 items was developed bas...
متن کاملThe Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملThe Effectiveness of Emotionally-Focused Couple Therapy on Happiness and Quality of Married Life of Both Working Couples
The present study aimed to Determine the effectiveness of emotionally-focused couple therapy on happiness and quality of married life of dual-career couples. The method of study was quasi-experimental with a pre-test-post-test design with a control group. The statistical population included all working couples referred to Alborz Counseling Center in Karaj from the second half of October to the ...
متن کاملImproving Learner Performance in Producing Grammatical Structures
This experimental study examined the effectiveness of using focused and unfocused tasks on Iranian intermediate EFL learners’ performance in producing noun, adjective, and adverb clauses. In addition,the aim of this study was to explore the effects of form-focused instruction and the feedback students received from their teacher after doing focused grammar tasks. Data consisted of the scores of...
متن کامل